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A system and method for selecting and presenting personally 
targeted entities such as advertising, coupons, products and 
information content, based on tracking observed behavior on 
a user-by-user basis and utilizing an adaptive vector space 
representation for both information and behavior. The sys- 
tem matches users to entities in a manner that improves with 
increased operation and observation of user behavior. User 
behavior and entities (ads, coupons, products) and informa- 
tion (text) are all represented as content vectors in a unified 
vector space. The system is based on an information repre- 
sentation called content vectors that utilizes a constrained 
self organization learning technique to learn the relation- 
ships between symbols (typically words in unstructured 
text). Users and entities are each represented as content 
vectors. 
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NAME 


TYPICAL 


VALUE RANGE 


DESCRIPTION 


threshold 


0.5 


0.25-0.75 


Threshold bv which auarv i/Arfar 

1111 waiiviw J 1 1 Wwl Ul 

will be used instead of profile 
vector. 




0.1 


0.05-0.5 


Learning rate for profile update. 




0.01 


0.005-0.1 


Learning rate for ad update. 




1 .0e-06 


1.0e-05 - 
1 .Oe-07 


Update rate for query universe 
estimate of the mean. 


CprsOb 


1.0e-06 


l.Oe-05 - 
1 .Oe-07 


Update rate for profile universe 
estimate of the mean. 




2.5e-05 


1.0e-04 - 
1.0e-06 


Update rate for ad universe 
estimate of the mean. 




0.01 


0.001 -0.5 


Forgetting factor for profiles. 




0.01 


0.001 -0.5 


Forgetting factors for ads. 


tied 


0 


Application 
dependent 


Mean of ads. 




0 


Application 
dependent 


Mean of profiles. 




0 


Application 
dependent 


Mean of queries. 
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SYSTEM AND METHOD FOR OPTIMAL 
ADAPTIVE MATCHING OF USERS TO 
MOST RELEVANT ENTITY AND 
INFORMATION IN REAL-TIME 

BACKGROUND OF THE INVENTION 

1. Field of Invention 

The invention relates to associating entities and informa- 
tion with behavior More particularly, the invention relates to 
a system and a process for targeting and delivering 
advertising, coupons, products, or informational content to 
users based upon observed behavior. 

2. Description of the Related Technology 
The widespread availability of the World Wide Web (web) 

and Internet services has resulted in a unique set of adver- 
tising opportunities. Unlike conventional "broadcast" 
media, such as television and radio, the web is a "narrow- 
cast" medium that allows the user to have higher levels of 
control over the information they receive. Since users can 
control the retrieval of information, the advertising tech- 
niques utilized in the conventional broadcast model has 
become less effective and can alienate potential customers as 
a result of the "shotgun" effect. The potential of selectively 
targeting advertisements on a user-by-user basis has been 
unrealized due to the difficulty in performing meaningful 
targeting of customers. The current generation of web ad 
selection engines utilize a number of partially successful 
techniques to target customers. Typically, the effectiveness 
of these techniques is measured in terms of audience 
response rates. Audience response, also called 
"clickthrough", is evaluated by counting the number of users 
that click on a "banner advertisement" contained on a web 
page which is presented to the user. Clicking on the banner 
advertisement typically takes the user to the web site of the 
advertiser where additional information about a product or 
service is provided. In the current Internet advertising 
environment, clickthrough is the best measure of the effec- 
tiveness of advertising techniques. Consequently, the value 
of advertising is directly related to the effectiveness of the 40 
ad. Therefore, the maximization of clickthrough is of para- 
mount importance for most web sites for both practical and 
financial reasons. 

Some current banner advertising selection techniques are 
listed in the tables below. These techniques are divided into 
two classes. The first class of advertising selection 
techniques, shown in Table 1, use simplistic first generation 
techniques that are based only on static a priori information. 
The second class of advertising selection techniques, shown 
in Table 2, utilizes some of the techniques used in the more 
sophisticated second generation ad selection systems which 
may take into account some measure of user behavior, such 
as a user query, or make use of predisclosed user prefer- 
ences. However, these more sophisticated objectives often 
only complicate the problem. In Table 1 and Table 2 certain 
disadvantages of each method are given, however, the 
Tables are not meant to be all inclusive so not all disadvan- 
tages may be shown. 



TABLE 1 -continued 



Method 



First Generation Ad Targeting Techniques. 
Disadvantage 



Implies user behavior is related to usage domain 

Provides poor clickthrough 

Implies user behavior is related to usage browser 

type/version 

Provides poor clickthrough 

Implies user behavior is related to system type 

Provides poor clickthrough 

Implies user behavior is related to usage domain 

Provides poor clickthrough 

Requires up-to-date knowledge base of IP address 

versus location 

Provides poor clickthrough 

Provides no sfgnifirant user insight 

Requires up-to-date knowledge base of SIC code 

versus IP address 

Provides somewhat improved user clickthrough 
Provides no significant user insight 
Requires up-to-date knowledge base of SIC code 
versus IP address 

Provides no significant user insight 
Implies user behavior is related to size 
of company 

Provides poor clickthrough 

Requires maintenance of knowledge base 

Costly to maintain 

Cannot incorporate observed user behavior 



Browser type 

System type 

Service provider 

Geography/ 
location 

Site SIC Code 
Company size 



Knowledge base 
techniques 



30 



35 



TABLE 2 



45 



50 



Method 


Disadvantage 


User registration 


Requires user willingness to disclose information 


information 


Requires many judgments on products for 




effective operation 




Requires a substantial preexisting judgment 




database 


Search key words 


Misses ads that are conceptually close but do not 




contain key word 




Restricts available inventory 




Requires careful choice of key terms 


Hand targeting of 


Very labor intensive; does not scale well 


ads to sites or 


Humans Dot good at manual targeting for large 


pages 


numbers of ads 


Hand analysis of 


Very labor intensive; does not scale well 


site or locations 


Humans not good at manual analysis of large 


in a site 


numbers of sites 


Random 


Poor response rates 


presentation 


Alienates users 


Date/time 


Poor response rates 


Hand analyzed 


Very labor intensive 


sites 


High cost 



TABLE 1 



Method 



First Generation Ad Targeting Techniques. 
Disadvantage 



Domain name of 
user browser 



Provides no significant user insight; Does not 
account for user behavior 



In the current web environment, users utilize search 
services to find relevant or interesting information. These 
search services provide a potential focus for the identifica- 
tion of user behavior as defined by the searches they execute, 
the web pages they view and positions in the directory 
hierarchy they visit. However, existing banner advertising 
selection techniques fail to analyze this behavior when 
selecting an advertisement These search services provide an 
60 opportunity for presentation of user specific advertising. 
The conventional techniques shown in Table 1 and Table 
2, use hand coded associations and lists to perform customer 
"targeting." Certain techniques may use only the current 
user query as part of ad targeting. Each of the methods 
65 shown in Tables 1 and 2 may be associated with five general 
categories of advertisement selection techniques. These cat- 
egories include: 
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(t) rule-based categorization; that an advertiser cannot automatically target an ad to the 

(ii) keyword based ad selection; best pages within a site. Typically, advertisers want to 

(iii) assigning ads to individual web pages or sites; display different ads for each page in a site. Another problem 

(iv) assigning ads to branches in hierarchical organiza- with assigning ads to web sites is that the ads served to the 
tions of pages; and 5 user are not context dependent in that they fail to utilize user 

(v) collaborative filtering. specific information. 

A description of each category is provided below. Collaborative Filtering Techniques 
Rule Based Ad Selection Collaborative filtering (CF) techniques have been pro- 
Rule based ad selection uses available information and posed for the problem of selectively targeting specific ads to 
rules to select an appropriate ad. This technique can be 10 web users. The CF approach requires large numbers of users 
effective if large numbers of rules are manually coded and to formally register with the system and make preference 
the rule developer has a deep understanding of the problem judgments about the quality of the ads, coupons and infor- 
domain. However, one problem with this system is that mation content they receive. Under certain limited 
human intellectual effort is required to write or maintain the circumstances, this technique may provide some degree of 
rules. The development of a rule base may be very expensive 15 effective matching in small scale tests. However, for large 
and time consuming. tasks, such as found on the Internet, the algorithms become 
Additionally, the rules for advertisement selection are both computationally intractable and impractical, 
limited to available variables with discrete values and pro- Additionally, the CF techniques perform best when the 
vide "brittle'* decision boundaries. These decision bound- universe of entities are static rather than time evolving. In 
aries are typically binary and are frequently mishandled. 20 the Internet market, static content is the exception, not the 
Rule based ad selection requires extensive knowledge norm. Furthermore, the utility of the CF approach is directly 
about the targeted operating domain. Even with computer- related to the numbers of users that have made preference 
aided tools, a knowledge engineer is required to develop the judgments. Typically, users are reticent to spend a substan- 
rules and administer the system. Furthermore, humans have rial amount of effort which is required to make these 
demonstrated time and again, that they are poor at encoding 25 judgments both because of personal time limitations and 
rules. This observation is particularly true when large num- concerns about privacy. As such, data acquisition becomes a 
bers of variables are encompassed within the scope of the major problem for the advertiser. 

problem being modeled. CF solutions typically require a knowledge engineer to 
Keyword Based Ad Selection implement a system of sparse vectors of a very high 
In keyword based ad selection systems, the ads are 30 dimension, where the dimension of the problem space is 
selected on the basis of one or more user provided words. equal to the number of entities under consideration. 
When an observed user behavior (typically a user issued However, this solution has been found to be computationally 
query) contains a known keyword, one of the ads, which is intractable. Another problem with CF techniques is that the 
manually associated with the keyword, is selected for dis- ad selection software is examining for orthogonal relation- 
play. This technique provides good response rates for the 35 ships which requires a substantial number of statistically 
keywords chosen. However, a major drawback with this complicated steps. The ad selection software must find close 
approach is the system administrator must manually choose neighbors in the vector space and suggesting coordinates not 
the keywords associated with each ad. This technique, based in common between the neighbors. 

on intellectual effort and deep knowledge of the ad-specific The existing ad selection systems, particularly in coupon 

domain, is time consuming and error prone. Additionally, the 40 and print media advertising, fail to address the interactive 

"inventory" of keywords at a site quickly becomes sold out. nature of the Internet and electronic commerce. Advertisers 

Lastly, with keyword based ad selection techniques, the ad need to be able to identify users of specific interests, track 

selection process does not account for previous user behav- those interests over time and disseminate, in a highly selec- 

ior. Tbe ad selector only uses a set of human-selected tive way, information, advertising, coupons and product 

keywords in the current inventory based upon the current 45 offerings that will be of interest to the user. Additionally, 

search query. Synonyms of user provided words are not advertisers need to track user interests and behavior in a 

automatically targeted without a thesaurus or synonym list. real-time manner. 

Assigning Ads To Individual Pages Therefore, advertisers need a system which is sensitive to 

Ads are sometimes manually targeted to individual web user behavior for targeting advertising, coupons, products 

pages. This method requires human intellectual effort to 50 and information content. This system should enable the 

match an advertisement to a web page. However, such effort targeted marketing in real-time and to a granularity of an 

is usually prohibitive in large scale sites containing thou- individual user as well as groups of users that have similar 

sands of web pages. behavioral characteristics. 

Assigning Ads to Branches with Hierarchies of Pages im/rNmnKI 

Ads are sometimes manually targeted to hierarchies or 55 SUMMARY OF THE INVENTION 

categorizations of pages called page ontologies. This tech- The present invention includes a system and method for 

nique may provide effective performance if the human delivering targeted entities, such as advertising, coupons, 

making the assignment of the ad to a branch of the hierarchy products and information content, to users based on tracking 

has a good feel for the content of the site and understanding observed behavior. The system model is an adaptive vector 

of the intended viewer. However, human intellectual effort is 60 space representation for entities, information and behavior, 

required to select the appropriate branch or category for the The system matches users to entities in a manner that 

ad. improves with increased operation and observation of user 

Assigning Ads To Web Sites behavior. 

Ads are sometimes manually targeted to all of the pages The present invention uses an informational representa- 

in a web site. However, as was seen with many of the other 65 tion called entity or content vectors. The content vectors are 

systems, human intellectual effort is required to select the a high dimensional, real vector representation that encodes 

appropriate ad for tbe web site. This method is inflexible in contextual similarity. The system utilizes a constrained 
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self-organization learning algorithm to learn the relation- FIG. 4 is a block diagram illustrating the inputs and 

ships between symbols which are typically words in unstmc- outputs of the hash table creation process of the word 

tured text. This learning algorithm discovers relationships content vector learning module shown in FIG. 2. 

between words or symbols from a set of training examples FIG. 5 is a block diagram illustrating the functions of the 
for a domain of interest and assigns a unique content vector 5 stem generating process of the word content vector learning 

to each symbol. The resulting representation is such that module shown in FIG. 2. 

symbols that appear in a similar context will possess content tnr-£- ui u j- •» . t _ 

vectors that point in similar directions in this high dimen- t ™" 6 15 f b <** dia&ram ^8 stcm frequency 

sional contenYvector space. calculatl0D P roccss of the word content vector 

„, , , r . , learning module shown in FIG. 2. 

Web pages, banner advertisements, coupons, and any io . . .„ . ,.„. 

other textual or symbolic information, are represented as a nG / 7 V f"^ the different learning 

summary content vector by forming the normalized ^voluUonal windows of the word content vector learning 

weighted vector sum of the content vectors of the words modulc shown m nG 2 

(symbols) in the page or description of the product or FIG * 8 ^ a dia gram illustrating the vector manipulation 
service. These summary content vectors have the property is process for adapting a stem vector by the vector of neigh- 
that pages that discuss similar information have summary boring stems for a convolutional window shown in FIG. 7. 
content vectors that point in similar directions. FIG. 9 is a block diagram illustrating the inputs and 
The actions of each user are represented by profile vec- outputs of the entity vector creation process of the session 
tors. Typical user actions include, but are not limited to: compiler shown in FIG. 2. 

issuing queries requesting or reading pages, visiting Internet 20 FIG. 10a is a block diagram illustrating the inputs and 

sites, responding to ads, and redeeming coupons and pur- outputs of the process for creating the highspeed files shown 

chasing products. A user's profile vector is configured to in FIG. 2. 

track observed behavior by using the content vector repre- FIG. 10b is a block diagram illustrating the inputs and 

sentation of the associated actions. For example, when the outputs of the process for generating the session control file 
user requests a page of information, the user's profile vector 25 shown in FIG. 2. 

is adjusted based upon the entity vector of the requested FIG. 11 is a block diagram illustrating the inputs and 

P 3 ^" outputs of the process for transforming a page request into 

The user profile vector is utilized as the mechanism for an content vector which is performed by the real time ad 

selecting the best entity (e.g., ad, coupon, product or sever module shown in FIG 2 

information) to present to the user. The user profile vector is FIG. 12 is a block diagram illustrating the inputs and 

compared to each acUveenUty vector, and the closest entity outputs of ^ for transforming a q * eiy int0 * OOBtBat 

content vector » selected for display to the user. vector which is performed by the real time ad server module 

The present system also performs the optimal adaptive shown in FIG. 2. 
positioning of entity vectors. If a user is presented a banner 3S n G . 13 is a bloc* diagram mustrating the functions of the 

ad and requests more irformation by clicking the mouse on ad sdection process performed by the real time ad server 

it, the entity vector for the ad is adjusted using the user's shown in FIG. 2. 

profile vector. Additionally, the user profile vector is CTr . 1/4 • '., ' . .. .„ . . , , . 

adjusted using the ad content vector. This mutual adaptation " G - 14 " a **** diagram ^^ a S the functions of the 

allows optimum positioning of ads relative to users An profiJe ^ ctor ada P'i t,on P rooess o£ ^ P rofile vec,or "P^ 

... , b „. 40 server shown in FIG. 2. 

In this system, the user profile vectors evolve through a „~ ■ . ,. ... „ . . . 

user's usage of the system. Clustering and classificXn n f 1 ?" 15 f f. a W °* " lu ^g te mpos and the 

techniques are apphed to the adapted profile vector set to ^ I H" ^ <tag , f? u™^ 

discover behavior patterns that are utilized for focused « labonsM P. decover y and «»l3" ™>*d* shown in FIG. 

marketing campaigns. 4S 

Likewise, ad and coupon entity vectors evolve as a result nG ™ * ^ atin S ^ of * c 

of user behavior. THese evolved vectors encode summary by ^ ^ VCCt ° r "P datC *™ modu1 ' 

characteristics of the users who clicked on the ad or f OWU m ™ G : f UpdatCS thc P rofilc vcctor databasc ^ 

redeemed the coupon. These summary characteristics are d&mo &*P**c information. 

used to modify product positioning and discover the char- 50 RG - 17 B a block dia 5 ram i^trating the functions of the 

acteristics of the users who are interested in the product or P 1 ™*^ b Y which thc cntit y vc ctor update server module 

service. shown in FIG. 2 adapts entity vectors. 

Trie system is fully automated and does not require any FIG " 18 fa a flowchart illustrating the vector adaptation 

rules for operation and is based on self organizing neural P 100 ^ of the profile vector update server and the entity 

network techniques. Additionally, the system operates using ss vcctor ^ scrvcr ^ own m FIG * 2 - 

tacit user observation and does not require explicit user FIGS. 19a and 19b are the equation flow diagrams for the 

judgments for correct operation. fixed rate learning algorithm performed by the profile vec- 

BRIEF DESCRIPTION OF THE DRAWINGS s^wn^RG^ "* ^ ^ 

FIG. 1 is a functional block diagram illustrating a com- w FIG. 20 is a chart containing a summary of sample run • 

puter environment that may be associated with the present time parameters for the fixed rate learning algorithm shown 

invention. in iq GS . i9a and 19b 
FIG. 2 is a functional block diagram illustrating the 

processing and data components of the present invention. DETAILED DESCRIPTION OF THE 

FIG. 3 is a block diagram illustrating the inputs and 65 PREFERRED EMBODIMENTS 

outputs of the preprocessing process of the word content The following presents a detailed description of certain 

vector learning module shown in FIG. 2. specific embodiments of the present invention. However, the 
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invention can be embodied in a multitude of different ways vectors. Thus informational items having similar meanings 

as defined and covered by the claims. In this description, have closely aligned vectors, while informational items 

reference is made to the drawings wherein like parts are having dissimilar meanings have approximately orthogonal 

designated with like numerals throughout. vectors. 

FIG. 1 is a block diagram illustrating a computer envi- 5 In the following description, the words "content vectors" 

ronment associated with the present invention. A client and "entity vectors 1 * are used synonomously. Both terms 

computer 100 has a monitor 102 and a processing unit 103. refer to a vector that is associated with certain content 

The processing unit 103 includes a data storage. information, ad, coupon, solicitation, or product, for 

The client computer 100 stores information that may be instance, 
exported to other computing devices through a network 104. 10 The following description is divided into the sections: (1) 
The network 104 may include any type of electronically System Component Overview, (2) Word Content Vector 
connected group of computers including, for instance, the Learning Module, (3) Session Compiler Module, (4) Real- 
following networks: Internet, Intranet, Local Area Networks Time Ad Server Module, (5) Profile Vector Update Server 
(LAN) or Wide Area Networks (WAN). In addition, the Module, (6) Entity Vector Update Server Module, (7) Profile 
connectivity to the network may be, for example, remote 15 and Entity Adaptation Algorithms, (8) Calculation of Entity 
modem, Ethernet (IEEE 802.3), Token Ring (IEEE 802.5), Inventory Weighting (Cost Model), and (9) Relationship 
Fiber Distributed Datalink Interface (FDDI) or Asynchro- Discovery and Analysis Processing Module, 
nous Transfer Mode (ATM). Note that computing devices 

may be desktop, server, portable, hand-held, set-top, or any Svstcm Component Overview 

other desired type of configuration. 20 Beginning the discussion by reference to the bottom of 

When a remote user requests information from the net- FIG. 2, a word content vector learning module 200 creates 

work 104, the client computer 100 displays an advertisement a set of word content vectors from a domain relevant training 

with the requested information. Based on a request from the corpus 202, a stop list file 204, and a phrase list file 206. The 

client computer 100, a server 106 may select an appropriate 25 stop list file 204 contains a list of words, word fragments and 

advertisement to display to the user based upon the user's phrases for the word content vector learning module 200 to 

observed behavior. ignore. The phrase list file 206 is a list of words that the word 

In one implementation, the server 106 includes a gateway content vector learning module 200 groups together. The 

which is connected to a WAN 108. The WAN 108 has a training corpus 202 is the set of informational objects used 

plurality of network servers 110. One of the network servers 30 to train the word vectors. The functions of the training 

110 is connected to a LAN 112 comprising a plurality of corpus 202, the stop list file 204, and the phrase list file 206 

computers 114. The system software that selects the adver- are further described below. 

tisement may be located on one the network servers 110 or When the word content learning process is complete, 

another computer in the network 104. In one embodiment of there is one word content vector in a word vector database 

the invention, the advertisement selection software executes 35 208 for each word in the training corpus 202 that is not in 

in part on a plurality of the network servers 110. In another the stop list 204. These word content vectors have the 

embodiment of the invention, the advertisement selection property that words that are used in a similar context have 

software executes on a plurality of the computers 114 on the content vectors that point in similar directions in the high 

LAN 112. In yet another embodiment of the invention, the dimensional content vector space. The word content vectors 

advertisement selection software resides on the client com- ^ serve as the basis for the ad selection operations performed 

puter 100. It is important to understand that the advertise- by the present system during its operation. The word content 

ment selection system of the present invention may be vector learning process is performed once as part of the 

hosted on any computing device so long as a communication initialization process. However, these vectors can be 

pathway exists between the advertisement selection soft- updated if desired. Normally, the word content vectors are 

ware and the requester of the advertisement FIG. 2 is a 4S static during normal operation of the invention, since the 

functional block diagram illustrating the processing and data meaning of words in the English language evolves slowly 

components of the present invention. The following descrip- over time. 

lion focuses on the system's ability to deliver advertising, a session compiler module 210 takes a set of session 

specifically banner advertising over the Internet. However, control information, in the form of structured text from a 

the techniques utilized in this invention are general in nature 50 session description database 212, and transforms the text 

and can be applied to any type of entity or information. into control commands for a real-time ad server (RTAS) 

Furthermore, the network environment need not be the module 214 in a session control file 215. Typical information 

Internet as indicated previously. processed during the compilation process includes the num- 

The present invention employs vector modeling of infor- ber of active ads, the number of allowable impressions for 

mation so as to identify relationships among the informa- 55 each ad, start and stop times, and dates for showing each ad. 

tional items or entity vectors. A process for making and The session compiler module 212 may be omitted from the 

using entity vectors is described in the U.S. Pat. No. 5,619, system if such functionality is not needed. 

709 filed Nov. 21, 1995, entitled "SYSTEM AND The present invention then computes an initial entity 

METHOD OF CONTEXT VECTOR GENERATION AND vector for each ad or entity in the system. In most cases, this 

RETRIEVAL" to Caid, et al. 50 entity vector is based on a textual description of the entity 

This entity vector generation scheme is designed to pro- which is located in an entity description database 216. For 

duce content vectors. In an application to words, for instance, if the entity is an ad for a car, the words in the 

instance, the scheme generates one vector per word, which advertisement are used to determine the entity vector. The 

represents the relative proximity of a meaning or content entity vector is computed as a normalized weighted sum of 

among words or documents in a quantitative and geometric 65 the word content vectors that are in the advertisement. A 

manner. The system then creates an entity vector based upon textual description of any item can be used as the initial 

the words in the entity and their associated word content condition for the entity vector. Since entity vectors can 



06/20/2003, EAST Version: 1.04.0000 



6,134,532 

9 10 

evolve during system operation, the initial condition of the The EVUS module 228 performs a nearly identical action 

entity vector can be null, and no textual description of the as the PVUS module 224, but performs the operation on the 

entity is required for system operation. However, in normal entity vectors for advertisements. Each entity vector is 
operation, this evohiuoa may require substantial time if stored in an entity vector database 230. R>r example, if the 

system usage rate is low and as a consequence, the preferred 5 user dicfo on ^ a( j, the entity vector for the user selected 

embodiment would start with any reasonable text descrip- ad ^ movcd a small stcp m tbc diction 0 f the user's profile 

tion as an initial condition. vector 

In the present invention, more ads can be sold in a A . . , - . . . rrtr ^A\ j i 

*u — *• Z~ c . »*u _ • „ 4 „ A re lauonsnip discovery and analysis (RDA) module 236 

thematic region or interest than in previous systems since all - r . / t J v r . , , 

ads are selected on the basis of vector comparisons and not in P^™ 5 P^^g actions on the contents of both the 

exact matches, thus increasing the available inventory. For 10 P™ file vect ° r dat ^ast 226 and the entity vector database 

example, the provider of the present invention could sell an to * nl * patterns and trends. The RDA module 

ad slot based upon the regions of the vector hyperspace close 236 utilizes both cliistcring of the entity vectors and profile 

to the words in the ad and not just a selected set of words. vcctors to ldcnUfv uscful bchavior P atterns - 

The entity vector approach also allows administrators to 15 li should be noted that the adaptation of the entity vectors 
effectively setup ad definitions without special skills or fe not required for system operation. Indeed, when adapta- 
domain knowledge, thus improving overall system perfor- tion * enablcd > il allows the entity vector of the product to 
mance without the requirement for domain experts. The ' ldrift " toward users who have received the ad. In some 
present invention automatically targets an ad to the best applications, information content delivery systems in 
pages within a site, not just the site as a whole. As a 20 particular, this drift is not desirable. Clearly, the information 
consequence of this feature, different ads may be associated content of a news story is deterministic and should not 
with the same page at a site. In addition, the present change based on who reads the story. As such, the present 
invention automatically selects the most relevant pages invention has provisions to disable the adaptation of entity 
within a hierarchy or category for a particular ad. vectors. However, in many applications, such as Internet 
Alternatively, the system can be configured to serve the best „ advertising, the drift of the vectors is the mechanism 
ad independent of any ontology that might be present and whereby the system discovers patterns and trends, 
based upon only on matching to observed user behavior. In the preferred embodiment, described herein, the soft- 
la the present invention, the ads served to the user are ware programs are written in the C, C++ and TCL program- 
context dependent in that they depend on both the contents min S language and run under the well-known UNIX oper- 
of the page, the current user action and the historical user 3 q ami S system. C, C++, and TCL are industry standard 
profile. The entity vector approach provides a solution which programming languages for which many commercial com- 
is compact and adaptive. The entity vector representation pilers can be used to create executable code. However, the 
requires a much smaller amount of data for training as present invention may be run on a variety of conventional 
compared to the collaborative filtering approaches. hardware and software platforms. The software system 
Additionally, the multidimensional entity vector represents- 35 described herein may be distributed across different sub- 
tion of the present system leads to superior generalization systems running on different hardware platforms. The corn- 
capabilities as compared to the collaborative filtering ponents may communicate using messages transmitted 
approaches. using any network protocol, such as Transmission Control 
Referring again to FIG. 2, the real-time ad server (RTAS) Protocol/Internet Protocol (TCP/IP) and RCP. The specific 
module 214 is the engine which observes the user behavior, 40 P lalform mat hosts these modules and files is unimportant to 
and based upon that behavior, selects the appropriate ad to invention provided that the host computing environment 
present to the user computer 219 located at the top of the has sufficient processing power. Additionally, each process- 
figure. The RTAS module 214 uses the current user action, and/or ^ component may be hosted on a separate 
the user's profile vector and the active set of entity vectors platform as long as paths of connectivity are provided via 
in selecting the appropriate advertisement. The RTAS mod- 45 anv sufficiently robust communication protocol, 
ule 214 uses vector proximity calculations such as inner The present invention includes six principle processing 
product to determine the optimal ad to display to the user. modules as follows: the word content vector learning mod- 
However, other distance metrics may also be employed If ule 200, the session compiler module 210, the real-time ad 
the RTAS module 214 detects that a user has performed an server module 214, the entity vector update server module 
"adaptation event** on the client computer 219 such as 50 228, the profile vector update server module 224, and the 
issuing a query 217, requesting a page 218, purchasing a relationship discovery and analysis module 236. Below, the 
product or clicking on an ad 222, the RTAS module 214 aforementioned components are described in further detail, 
sends an update message to the a profile update server 

(PVUS) module 224 or an entity update server (EVUS) Vector Learning Module 

module 228. The RTAS module 214 utilizes a static content 55 The word content vector learning module 200 (shown in 

vector database 234 and a high speed file 232 to hold FIG. 2) is implemented as an application programming 

working values of the entity or ad vectors. Upon the selec- interface. The application programming interface may be 

tion of the proper entity, the RTAS module 214 sends the part of a computer program library. Advertisers use the 

selected ad to the client computer 219. procedures provided by this library to create a stand alone 

The PVUS module 224 adapts the user profile vectors in 60 executable computer program capable of running on any 

response to a request from the RTAS module 214. A profile platform of sufficient processing power. This program per- 

vector for each user is stored in a profile vector database 226. forms the content vector learning process which in one 

This adaptation process moves the user profile vector in the embodiment, is further described in the Caid patent, 

direction of the entity vector associated with the adaptation Additionally, this program writes the output of entity vector 

event For instance, if the user cricks on an ad, the user 65 learning into data structures which are then utilized by (he 

profile vector is moved a small step in the direction of the session compiler module 210 to create files optimized for 

entity vector for the ad. runtime efficiency. 
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The training corpus 202 which includes a large number of stemming process removes common suffixes from each of 

documents containing unstructured text or symbols is the words in the training corpus 202. 

selected and processed through the content vector learning FIG. 4 is a block diagram illustrating the inputs and 

algorithm. The resulting objects produced through the con- outputs of the process for creating a stem hash table 400. The 

tent vector learning process are then adapted for use by the 5 stem hash table 400 is created by a stem hash table creation 

other system components. module 402. The stem hash table creation module 402 is a 

FIG. 3 is a block diagram illustrating the inputs and sub-component of the word content vector learning module 

outputs of a preprocessing module 300 of the word content 5°° F ° r ^ ^ 302 1D *J* mte ™ d *^£* C ?°t' 

learning module 200 shown in FIG. 2. As part of the da 'f 404 are made m a / tem hash table 400 11115 hash 

pre-processing the word content vector learning performs 10 ^ ble also contains pointers for each stem to a unique data 

the foDowing functions: element mat consists of a stem text stnn 2> stem statistics, a 

„ stem content vector, and a correction content vector that is 

. orms input. used as temporary storage to accumulate the change vector 

Referring to FIG. 3, the requisite input files for the word to ^ applied to the stem content vector ^ of the 

content vector learning module 200 are assembled. The three learning process 

input files are the training corpus 202, the stop list file 204 15 f, g . 5 ^ , block & mustralillg ,he random number 

and the phrase list file 206. . - . . .. . ° . c 
. , y . t , , , . generating process for creating initial conditions for the stem 
In the situation where the symbols of interest are a series vectors. The initial conditions for each stem data item in the 
of words 301, the training corpus 202 includes a set of stem hash table 400 is created by a uniform pseudorandom 
unstructured free form text documents that cover the domain ^ number generator (UPNG) module 500. The UPNG module 
of interest For example, if the domain of interest is outdoor 500 gcncrates an initial random content vector and stores 
sports such as hunting or fishing, an appropriate training mis vcctor m lhc stcm vcctQr ^ Tjp NG mo dule 
corpus might be 3 years of articles from Field and Stream 500 generates initial conditions for stem vectors through the 
magazine and the outdoor section of the local newspaper. ^ of a long rfod pseudorandom number generator which 
This text is typically a "flat" ASCII format with any type- ^ creates a flat distribution random number 502. Any random 
setting or formatting information removed. number generator can be ^ assuming the period of 
The stop list file 204 (shown m FIGS. 2 and 3) is repetition is sufficiently long (more than 100,000,000 
developed through a combination of human intellectual cycles). Using the flat distribution random number 502, a 
effort and conventional statistical techniques, that contains a transformer module 504, creates a Gaussian random variate 
list of words, word fragments and phrases for the word 3Q 506 mat has me property of zero mean and unit variance. An 
content vector learning module 200 to ignore. For the installation module 508 assigns the first Gaussian random 
content vector word learning, as with conventional statistical var iate to the first element of the current stem content vector, 
information retrieval techniques, many words are meaning- a second random variate is then assigned to the second 
less. For instance, "AND", "NOT", "BUT", and "TOE" are element of the stem content vector. This process of assigning 
words that convey litde meaning in terms of detennining the 35 unique Gaussian distributed random numbers is repeated 
thematic content of a set of text. As a consequence, these until all elements of all stem content vectors of all the stems 
terms should be ignored. The stop list file 204 is a flat ASCII m the stem hash table 400 have been populated, 
file that contains an exhaustive and enumerated list of words FIG. 6 is a block diagram illustrating the stem frequency 
to ignore with one entry per line. Hie typical length of this statistics calculation process of the content vector learning 
file for conventional applications is approximately 250-350 ^ module m p, G 2 A processing module 600 acquires 
of the statistically most frequently occurring words in the me stem freque ncy statistics from the intermediate stem file 
training corpus 202. 304. The processing module 600 is a sub-component of the 
The phrase list file 206 (shown in FIGS. 2 and 3) contains word content vector learning module 200. The processing 
the list of word sequences that are to be treated as a single module 600 reads the intermediate stem file 304 and corn- 
symbol. Using this file, the word content vector learning 45 pu tes the number of occurrences of each stem 302 in each 
module 200 treats certain phrases as single symbols. For document in the intermediate stem file 304. The processing 
instance, "San Diego" should be considered as a single module 600 then stores these statistics in the stem statistic 
symbol. The contents of the phrase list 206 are determined section of each data item 404 in the stem hash table 400. 
through intellectual effort, statistical techniques, linguistic FIG. 7 is a diagram of the different learning convolutional 
processing or any combination thereof. The phrase list file 50 ^ dov/s of me enti(y selection system shown in FIG. 2. A 
206 is not required for correct operation of the word content of ^makm t0 the mitial ^^0^ of the stem 
vector learning module 200, but the use of a meaningful veclors b^d on word co-occurrences are computed 
phrase list does improve system performance. The phrase for each of te stem content vectors in me stem hash lable 
list 206 is a flat ASCII file with one phrase per line in the file. 400 presem syslem modines the stem content vectors 
2. Performs word content vector learning. 55 based on the words that co-occur with a stem using a 
Once the training corpus 202, stop list file 204 and phase convolutional co-occurrence window. FIG. 7 shows an ini- 
list file 206 have been assembled, the word content vector tial source text 700. After some preprocessing, the source 
learning process is performed. The Caid patent provides a text 700 is transformed into the preprocessed text 702. The 
full disclosure on this approach, but a summary of the key system creates a convolutional window 704 to analyze a 
steps are repeated here for convenience. 60 center stem and the center stem's neighbors. The window 
Referring to FIG. 3, the preprocessing is performed by a has a center and up to three words on either side of the 
preprocessing module 300 which is part of the word content center, thus providing seven total words in the window. As 
vector learning module 200. The preprocessing module 300 can be seen in FIG. 7, the center of the window 704 is at the 
reads the training corpus 202, determines if the current word front of the document. As such, the window is one-sided, 
is on the phrase list file 206 or the stop list file 204, performs 65 The correction vector for the window 704, centered on the 
the stemming of the word, and writes a resulting valid stem stem "mao", consists of a vector function of the initial 
302 to an intermediate stem file 304 (shown in FIG. 3). The condition content vectors for "rebel", "group", and the 
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phrase "Sendero Luminoso." The correction vector for a 
window 706, centered on stem "rebel", consists of a vector 
function of the initial condition content vectors for "mao", 
"group", "Sendero Luminoso*' and "bomb" The window is 
moved one stem at a time and this process is continued for 
each stem in the prcprcccsscd text 702. At each update 
position, a correction vector is calculated and summed into 
an accumulated correction vector located in the data items 
404 of the stem hash table 400 (shown in FIG. 4). 

FIG. 8 is a diagram illustrating a manipulation process for 
calculating a stem vector by the vectors of its neighboring 
stems. Each stem in the stem hash tabic 400 is modified by 
the vector of its neighboring stems. For each convolutional 
window, a vector 800 associated with the center stem is 
adapted by a vector 802, 804, and 806 which are associated 
with the stems of the center stem's closest neighbors. 

The following equations 1-4 provide the method for 
calculating the correction for each word in the convolutional 
window shown in FIG. 7. 



10 
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only two passes are required to achieve excellent learning 
results. The equations 1-4 are applied for each window 
position and the results are accumulated. At the end of each 
pass through all the stems in the intermediate stem file 304, 
the equations 5-9 are applied. The algorithm disclosed in the 
equations 5-9 computes the mean stem content vector, 
subtracts this from each normalized stem content vector and 
then re-normalizes the entire set of stem content vectors. 
This computation prevents the stem content vectors from 
being localized to only one region of the hypersphere and 
promotes better utilization of the vector space. 
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The value E, y represents the error or correction between the 
current target (center) Ty and neighbor N jy . A correction for 
T y taking into account all current neighbors in window 
position is shown by equations 2 and 3. 
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The variables shown in equations 2 and 3 are denned as 
follows: 

(i) i=summation dummy variable; 

(ii) C^^^^^^rrection vector for stem j; 

(iii) c^ cct/A/t/LA7Er> =running summed correction for 
stem j stored in stem hash table data elements; 

(iv) WS=window size in stems; and 

(v) a t y«a proximity constraint that limits the correcting 
effect according to the statistics of stem i and j where 

a-i-e.e,. 



0i t =B + {l-B) 1 



(a/pJ 



\tnd) 



Equation 4 is used to calculate the variable a^- for equa- 
tion 3. The variables shown in equation 4 are denned as 
follows: 

(i) 9,-=statistical importance of stem i in the training 
corpus; 

(ii) B=bounds control parameter determined empirically; 

(iii) ND ( =number of documents that contain stem i; and 

(iv) TND-total number of documents in the training 
corpus. 

Equations 5-9 included below provide a final update to 
the vectors. It should be noted that the algorithm described 
in the Caid patent utilizes a batched update process. As such, 
the algorithm accumulates corrections to the target (center) 
stem content vector. These corrections are made only at the 
end of a pass through the training corpus 202. Typically, 
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The variables used in the equations 5 through 9 are defined 
as follows: 

(i) T/^-oM target content vector (initial condition or 
previous pass); 

(ii) Ty^ ^updated target content vector for current pass; 

(iii) y=learning rate; 

(iv) Fy=total number of instances of stem j in the training 
corpus; 

(v) C/^^^^^-accumulated correction vector for 
stem i for this pass; 

(vi) M=mean stem vector; and 

(vii) NSTEMS-total number of unique stems in the trig 
corpus. 

3. Writes output file. 

Once the training is complete, the resulting stem content 
vectors are written to the word vector database 208. 



45 



FIG. 



Session Compiler Module 
9 is a block diagram illustrating the inputs and 



outputs of the entity vector creation process of the session 
compiler module 210. In the present invention, the session 
50 compiler module 210 performs three functions: text to entity 
vector transformations, file format conversions, and session 
control file generation. Each of these functions is described 
below. 

1. Text to entity vector transformations. 
To enable the selection of data or information, a text to 
entity vector representation transformation is required to 
generate the entity vectors. FIG. 9 illustrates the processing 
performed by the session compiler module 210 when it 
converts textual information into an entity vector. A prepro- 
cessing module 1100 reads the entity descriptions from the 
entity description database 216. The preprocessing module 
1100 is a sub-component of the session compiler module 
210 shown in FIG. 2. The preprocessing module 1100 
module is similar to the preprocessing performed as part of 
65 the preprocessing module 300 shown in FIG. 3. The pre- 
processing module 1100 determines if the current word read 
from the entity descriptions database 216 is on the stop list 
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file 204 or on the phrase list file 206. The valid stems 302 are of entities, such as advertisements, that are eligible for 
then used by an entity vector formation module 1102 to selection to become active. The session control file 215 
generate static entity vectors which are then stored in the contains a control mechanism for the following attributes for 
static content vector database 234. each ad: 

Equation 10 is an example of the process performed by 5 Static or dynamic attribute control. An ad is either statically 
the session compiler module 210 shown in FIG. 2 to form set to a fixed location in the context vector space or 
entity vectors. The content vector for an entity is the allowed to dynamically adapt based upon user responses, 
normalized weighted sum of the symbols that comprise that Reset. If reset functionality is specified, the entity vector for 
entity. For those entities that are textual descriptions, the an existing ad is reset to its initial vector, 
description vector is the normalized weighted sum of the to Targeted impression count. The targeted impression count 
stem content vectors of the stems in the description. determines the number of impressions that should be 

delivered for the ad. 

ftsTEMS; (10) $ tart date. The start date determines the date and time that 

tne ac * *s to become eligible for selection. 
a NDj V s 15 End date. The end date determines the date and time that the 

ad is to become ineligible for selection. 



ECV; = 



Tf'jTtW, 1 Initial conditions. TV advertiser can describe the initial 

/ i )o \NDjf CVi \ entity vector selection for the ad. Options for the initial 

J W vector selection include: 

20 (i) a vector based upon the ad and the aforementioned text 

The variables in equation 10 are defined: to vcctor conversion process; 

(i) EC V-the entity content vector for entity i; ( u ) a vector based u P° n me entit y vector of a specified ad; 

(ii) SCV^-tbe stem content vector for stem j; ^ 

(iii) TND-the total number of documents in the training 25 (iii) a vector based upon an entity vector contained in an 
corpus; external file. 

(iv) ND-the number of documents in the training corpus ^ s^ 00 compiler module 210 also establishes a 
that contain stem j- and connection with the EVUS module 228 (shown in FIG. 2) 

(v) NSTEMS-the number of stems in the description of which fc the «P«itoiy for all active entity information^ The 
ent'tv ' 30 uutia * en tity vectors are calculated for the ads and the ad 

2. File format conversions. identifiers, parameters, and vectors are transmitted to the 
FIG. 10a is a block diagram illustrating the process for ™ module 225 for stora S e fa tbe ^ veclor database 

creating the high speed files 232 shown in FIG. 2. The 

session compiler module 210 performs the file format con- „ AJO wji 

* r -i * . *u i .u Real Time Ad Server Module 

versions to facilitate the entity selection process, since the 35 

data structure formats that are computationally efficient The RTAS module 214 provides an Application Program- 
during word content vector learning are inappropriate for ming for Interface (API) local or remote software appli ca- 
use by the RTAS module 214 during entity selection. The tions call to perform various ad management routines. The 
operational flexibility and complexity required for the word RTAS module 214 selects according to guidelines set in the 
content vector learning process is not required by the RTAS 40 session control file 215 an advertisement to be displayed to 
module 214, so substantial simplifications are made. As part a user. 

of this transformation, all of the stem content vectors are During the ad selection processing, the RTAS module 214 

pre -scaled by their frequency of occurrence in the training performs the processing steps listed below. 

corpus 202. The stem content vectors are scaled by the same . . ... v 4 . 

- r , 1. Initialization, 

factor shown in equation 10 without the final normalization 45 

step. Additionally, the low frequency stems (those that occur When *e RTAS module 214 receives an initialization 
less than four times in the training corpus) arc removed from command, RT ^S module 214 reads the session control 
the file. The remaining stems represent the statistically most mc 215 ™ d initializes internal control tables stored in 
significant portion of the overall stem population. The memory. The RTAS module 214 then requests the appro- 
session compiler module 210 then writes the resulting 50 P riate cntit y vcct0IS ^ specified by the contents of 
tnincated and pre-scaled stem content vectors to the high me t* 55 ™ me 215 ^ ^AS module 214 ^quests 
speed files 232 from the entity vector database 230 via the EVUS module 

3. Session control file generation. 228 entit V vectors for active entities stores them 
FIG. 10b is a block diagram illustrating tbe process for m a memory resident data structure within the RTAS module 

generating the session control file 215. The primary function 55 214 0nce initialization is complete, the RTAS module 214 

of the session compiler module 210 is the generation of a * read y for real-time operation, 

session control file 215 from operator created session 2. Determination of user behavior vector, 

descriptions contained in the session description file 212. When a user action message enters the system via the 

The session control file 215 contains the instructions in client API, the RTAS module 214 establishes the behavior 

machine readable format to guide the RTAS module 214 60 vector for that user. If the user is already known to the 

during the ad selection process. The session control file 215 system, the user id, provided by the user's web browser, is 

contains the information specified in the session description used as the key for a database retrieval The RTAS module 

file 212 and a set of initial entity vectors. The session control 214 passes the user ID to the PVUS module 224. The PVUS 

file 215 has the ability to identify a group of ads eligible for module 224 then returns the profile vector associated with 

selection and to provide a set of backup entity vectors in 65 that user from the profile vector database 226. If the user is 

event the EVUS module 228 is not operational. The session unknown to the system, a new initialized vector is returned, 

information in the session description file 212 describes a set In either case, the RTAS module 214 has a valid user 
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behavior vector for subsequent operations. This vector is At a state 1602, the RTAS module 214 selects the closest 

stored within internal memory tables within the RTAS ad subject to inventory management constraints, 

module 214. Specifically, if the closest ad has already been presented 

3. Observation of user behavior. sufficient times to the user population, the RTAS module 214 

The RTAS module 214 also incorporates user behavior 5 P rcsents thc second closest matching ad is selected. The 

into the user's profile vector. These behaviors include a web inventory management scheme can be arbitrarily complex 

page request 218 or a search query 217 as shown in FIG. 2. and mc RTAS module 214 has provisions to interface to an 

The client computer 219 passes a behavior packet including external inventory management module, that may be used to 

the observed behavior in the client API. The RTAS module adoptively balance ad presentations subject to a series of 

214 uses the information contained in the user behavior 10 external constraints. An inventory weighting module which 

packet to generate a behavior vector for the user action. fe a sub-component of the RTAS module 214 performs all of 

FIG. 11 is a block diagram illustrating the inputs and * e mventor y ^^J* vector 

outputs of the process for transforming a page request into clo f ene f we ? h ^ T ^ 7°^ ill ^ 

an content vector which is performed by the RTAS module and ^ each ad W h * «** < statc 1604 >' 

214 shown in FIG. 2. If the observed user behavior includes 15 ° nce the closest entity or ad has been selected, subject to 

a page, navigation or hierarchy request 218, a page content *he constraints described above, the RTAS module 214 

vector lookup module 1402 which is part of the RTAS V* 31 ** the ID of me selected ad via the API to the client 

module 214 retrieves the entity vector associated with the ID computer 219. The ID is used by the client computer 219 to 

of the requested page from the static content vector database retrieve and display the actual ad image to the user via the 

234. The RTAS module 214 augments the entity vector 20 web 

module of the user requested page with the user ID to To improve performance, the RTAS module 214 caches 

provide a behavior vector. the set of entity vectors and parameters described by the 

FIG. 12 is a block diagram illustrating the inputs and aclive session control file 215. The RTAS module 214 

outputs of the process for transforming a query into a content ^ obtains periodic updates from the EVUS module 228 for 

vector which is performed by the real-time ad server module changed entities. 

shown in FIG. 2. If a client computer 219 observes user 5. Observe user response to entity presentation, 

behavior such as a user query 217, the RTAS module 214 Once the best entity or ad is determined and the host 

transforms the query 217 into a content vector by using the system generates a web page that incorporates the selected 

memory image of the highspeed file 232 to provide an entity ^ a d, the user response to the ad is monitored. If the user 

vector based upon the query terms. A preprocessing module requests more information on the product, as indicated by 

1502 which is part of the RTAS module 214, removes clicking on the entity or ad, this event is sent via the API to 

common words that are identified in the stop list file 204, me RTAS module 214. However, all ad presentations need 

combines the words that are part of a phrase list file 206 and not result in a message to the RTAS 214 module, 

performs the stemming process of the terms in the 35 6 of user vectQr 

user query 217. The RTAS module 214 then creates a user ^ t t „ A t 

behavior vector by augmenting the result of the aforemen- ^ RTA f module 214 ada P te me ^havicr profile 

tioned query 217 with the user ID by a form entity content vector to reflect observed user actions. The observed user 

vector module 1504 behavior vector is then used to update the profile vector for 

4 Selection of best entit to resent ^ at USCT ' ^ e **TAS module 214 sends the user ID and 

„ . . . , .. .„ • . r. r, 40 observed user behavior vector to the PVXJS module 224 for 

FIG. 13 is a block diagram illustrating the functrcns of the modificatioQ ofthe h^ ori cal user profile vector. Hie opera- 
ad selection process performed by the RTAS module 214 tion of ^ pvus modu , c 224 is described below in greater 
shown in FIG. 2. If the client computer 219 (shown in FIG. detail 
2) does not observe user behavior, the RTAS module 214 

uses historical user profile vector from the profile vector 45 7 * of ad or enUtv vector * 

database 226 as the basis for selection of the best entity to If toe user responds to the entity (ad, coupon or 

present If the client computer 219 observes interactive solicitation), the entity vector and user ID are sent to the 

behavior, the RTAS module 214 uses the observed user EVUS module 228 for update (shown in FIG. 17). The 

behavior vector as the basis for selecting the best entity to EVUS module 228 then adapts the entity vector toward the 

present 50 observed behavior vector of the user. The operation of the 

The RTAS module 214 uses the historical user profile Evus module 228 is described below in greater detail, 

vector or behavior vector as the basis of a vector comparison „. , , n 

against all active ads as specified by the session control file Vcctor U P dale Scrvcr Modulc 

215. The contents of the session control files 215, which arc The PVUS module 224 is responsible for the storage, 

loaded into memory by the RTAS module 214 during 55 retrieval and updating of the user profile vectors that are 

initialization, are available for high speed searches. By contained in the profile vector database 224. 

placing the session control files in memory of the RTAS ^ USCf file vcctQrs arc ^ basis for a ^ 

module 214, a performance gam is realized. of capabilities mchlding: 

Referring again to FIG. 13, the RTAS module 214 per- ' . . - t , . * . „ 

forms the optimum entity selection by forming dot plod- 60 ® *«min««ii of targeted advertising, coupons, offers 

ucts of entity vectors with behavior and user vectors (state and 10 *e user, 

1600). The RTAS module 214 performs a search process by ( u ) analysis of user behavior; 

computing a vector closeness metric between the vector to (iii) correlation of user behavior with advertising, product 

be compared and all active ad vectors. Although the RTAS and service descriptions; 

module 214 may use any vector proximity metric, the 65 (iv) inference of user characteristics based on cross- 
preferred embodiment of the invention uses a vector inner correlation of user behavior profiles with demographic 
product. purchasing databases; 
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(v) tracking user behavior over time; FIG. 14 is a block diagram illustrating the profile vector 

(vi) predictive modeling of likely user behavior and adaptation process of the PVUS module 224 shown in FIG. 
optimization of information presentation based on 2. The PVUS module 224 contains several subcomponents 
these predictive models; and including a transform query to content vector (TQCV) 

(vii) accumulation of effectiveness and click-through sta- 5 module 1700, a transform action to content vector (TACV) 
tistics to optimize predictive models, provide manage- module 1704, a profile vector lookup (PVL) module 1706, 
ment reports and lay groundwork for increased ad and a profile vector adaptation (PVA) module 1702. If the 
revenues. user enters a query 217 to the system, the query string and 

In the preferred embodiment of the invention, the PVUS the user ID are passed to a TQCV module 1700. The TQCV 

module 224 also incorporates external information, such as 10 module 1700 transforms the query string into a query 

credit card purchasing histories into the user profiles. This ^ntem vector ^ the manner as the process shown in 

information is easily incorporated, since the content vector mQ Wb The resulling query content vector is passed to the 

representation accommodates the aggregation lof heteroge- pyA 1?Q2 fa ad lalion of lhe ^ file vecton 

neous symbolic information, such as ^web behavior and Tfae pVA module 1702 mc adaptaU ^ n algorithm 

credit card purchases, into a unified high dimensional space. „ , , , . - n r , <n , A ° CI 

Tlie PVUS module 224 is implemented as one or more 15 described below and shown m FIGS. 19a and 19*. Aprofile 

executable computer programs that receive requests from vec £ r databasc m ° dulc 17D j ^ tcd 

clients over a network protocol such as TCP/IP and gener- P rofile vcctor 15 10 me P rofilc vector databasc 226 for later 

ates responses to such requests. The PVUS module 224 has use * 

one or more physical databases which are used to store the If useT cucks on m ad or purchases a product 222, the 

user profile content vectors and parameters which are 20 TACV module 1704 uses the ad or product description to 

indexed by unique user IDs. Multiple PVUS modules may adapt the user profile vector. This module 1704 transforms 

be in simultaneous operation to handle large numbers of the entity ID into a content vector via a table lookup in 

users. When multiple servers are used, they are addressed by memory within the PVUS module 224. In parallel, the user 

the clients as a single entity. The PVUS modules may be ID is used to lookup the user profile vector from the profile 

scaled across multiple hardware platforms to support a large 25 vector database 226. The PVA module 1702 inputs these two 

user population. This distribution is accomplished by mul- vectors into the adaptation algorithm described in FIGS. 19a 

tiplexing the user ID to select one of N physical servers. and 19b. The profile vector database update module 1704 

Each server may further multiplex the request to one of M then writes updated profile vector to the profile vector 

physical databases. Multiplexing is achieved by generating database 226 for later use. 

a hash code from the user ID which results in essentially 30 4. Demographic predictions and tagging, 

random bit distributions in the hash code. There are many For many advertising and sales applications, it is advan- 

multiplexing hashing algorithms, that may be used. tageous to have an estimate of the demographic parameters 

However, the preferred embodiment uses the MD5 algo- for individuals as well as the entire user population. In the 

rithm. The multiplexor then uses a modulo N on the hash present system, although no user registration is performed, 

code to determine which of N children should handle the 35 advertisers can analyze the profiles vectors to retrieve user 

request demographics. Advertisers can use these profile vectors to 

The PVUS module 224 performs the processing actions estimate the age, gender, household income (HHI), and other 

listed below. demographic parameters of a user. The present system has 

1. Creation of new user profile vectors. provisions to incorporate demographic models into its 
Each user in the system is assigned a unique user ID. In 40 operation. When using these demographic models, the 

the web environment this could be the user's Internet IP demographic parameters of the entire population of users are 
address or a host selected ID placed in the browser "cookie" estimated and used to augment the information in the profile 
file. If a request is made for a user ID that is not contained vector database 226. This estimate process can be performed 
in the profile vector database 226, the PVUS module 224 by both conventional statistical methods or via neural net- 
creates an entry for the new user and initializes the new user 45 work techniques. 

profile vector to a pseudorandom content vector which will FIG. 15 is a block diagram illustrating the inputs and the 

be later adjusted by the adaptation algorithm (shown in outputs of the model building process performed by the 

FIGS. 19a and 196). relationship, discovery and analysis module shown in FIG. 

2. Retrieval of user profile vectors and parameters. 2. Predictive models for the estimation of age, gender and 
The PVUS module 224 obtains a user profile vector by 50 HHI are constructed using a combination of previous 

using the unique user ID to determine a hash code. The entry observed behavior contained in a historical user actions 
in the hash table at that location contains a pointer to the database 226, ground truth information from user registra- 
information in the profile vector database 226. The PVUS tion on a different service or via a ground truth user 
module 224 uses the database pointer to perform the demographics database 1904, and profile vectors from the 
retrieval of the user profile vector. If the request for the 55 profile vector database 226. Using this information, a mod- 
profile vector is from the RTAS module 214, the PVUS cling module 1900 constructs a set of model parameters, 
module 226 passes the profile vector to the RTAS module These model parameters are stored in a model parameter 
214 for subsequent use. database 1906. Any statistically valid process can be used in 

3. Adaptation of user profile vectors. the modeling module 1900, including neural network 
The PVUS module 224 performs the adaptation of the 60 techniques, to derive the predictive model parameters. The 

user profile vectors based upon some observation about a typical outputs of the predictive model are age, gender and 

user's behavior. As the user traverses a web site utilizing the HHI. 

present invention, behaviors are observed including user FIG. 16 is a block diagram illustrating the process by 

requested queries, web pages and ads. Each observed be hav- which the PVUS module 224 updates the profile vector 

ior is converted into a content vector and is also used to 65 database 226 with demographic information. For each user 

adapt a user's profile vector which is stored in the profile in the profile vector database, the PVL module 1706 

vector database 226. retrieves the user's associated profile vector from the profile 
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vector database 226. The PVL module 1706 passes this If the user requests a query 217, the TQCV module 2102 

information, with the user ID and historical actions to a transforms the actions into a query content vector, and 

predictive model processing module 2000. The predictive passes the result to the entity vector adaptation module 2100. 

model processing module 2000 uses the model parameters If the user event was a clickthrough or purchase 222, the 

from a model parameter database 1906 in conjunction with 5 entity vector lookup module 2104 retrieves the entity vector 

the current user profile vector and previous observed behav- for the selected entity from the entity vector database 230. 

ior to form an estimate of a set of demographic parameters The entity vector lookup module 2104 passes the vector for 

for the current user. This estimate is added to the information the selected entity to the entity vector adaptation module 

contained in the profile vector database 226 by a profile 2100. If the user ID was passed into the EVUS module 228, 

vector update module 1704. The augmented contents are to the profile vector lookup module 2106 retrieves the profile 

used as input to the RTAS module 214 during ad selection vector for the user from the profile vector database 226. The 

and as part of the relationship discovery and analysis pro- resulting content vector is then passed to the entity vector 

cess. adaptation module 2100. 

_ . __ . _ , , After receiving the inputs from the TQCV module 2102, 

Entity Vector Update Server Module 15 me entity vector lookup module 2 i04, and the profile vector 

The EVUS module 228 is responsible for the storage, lookup module 2106, the entity vector adaptation module 

retrieval and updating of the entity profile vectors from the 2100 performs the equations shown in FIGS. 19A and 19B. 

entity vector database 230. The EVUS module 228 is the ^ entix y vector adaptation module 2100 then passes the 

main processing module in the system to read and to write updated entity vector along with the entity ID and time 

the entity vector database 230. 20 stamp to the entity vector database update module 2108 

The EVUS module 228 consists of one or more execut- ?* crc mc ^nnation is stored in the entity vector database 
able computer programs that receive requests from clients 

over a network protocol (TCP/IP) and generates responses to 4 - Time updates. 

such requests. The EVUS module 228 may have one or more ^ The EVUS module 228 generates a list of entity IDs 

physical databases which are used to efficiently store and whose vectors or parameters that have changed since a given 

index the user profile content vectors and parameters by moment in time. This, information is used to refresh the 

unique entity IDs. memory image of the EVUS module 228 that is maintained 

During normal operation, the EVUS module 228 per- bv mc RTAS module 214 - When me rcfresh command 

forms the following processine. ™ ^^ag a stamp of the last update is sent from RTAS 

1 c . •* • „• A a i module 214, the EVUS module 228 sends to the RTAS 

1. Entity insertion and deletion. , , . A c . 

module 214 the entity IDs, vectors parameters for those 

The EVUS module 228 inserts new entities or ads and iterns lhat have changed and a new time stamp. Ibis 

removes old entities or ads from the entity vector database information is then saved in the RTAS internal memory for 

230 m response to commands from the session compiler use in the entity selection process, 

module 210. When a new session is compiled, the session 35 ^ Inventory management 

compiler module 210 sends update commands, new entity * __ _ , , „ _ * , 

vectors and parameters to the EVUS module 228. The EVUS module 228 performs inventory management 

Additionally, the session compiler module 210 sends com- weighting across all the ads in the system. The EVUS 

mands to toe EVUS module 228 to delete inactive entities m ? duk 228 ^ a weight parameter to adjust each of the ad 

from the entity vector database 230. 40 selection scores. The weight parameter is determined by the 

_ _ « . . , ads targeted number of impressions and the time remaining 

2. Store and retrieve entity vectors and parameters. f QT eac jj ac j 

The EVUS module 228 stores and retrieves the current 

parameters and vectors for all the active entities in the Profile and Entity Adaptation Algorithms 

system and retrieves the entity vector and associated param- 45 fig. 18 is a flowchart illustrating the vector adaptation 

eters for a given entity ID. During the initialization of the process of the profile vector update server and the entity 

RTAS module 214, the RTAS module 214 sends a command vector udpdate server upon the receipt of a user query 217 

to the EVUS module 228 requesting a copy of the most (shown in FIG. 2). Both the PVUS module 224 and the 

recent entity vectors and parameters to hold in the memory EV US module 228 apply a set of adaptation algorithms to 

of the RTAS module 214. jo a jjQ W me cvo lution of profile vectors and entity vectors as a 

3. Entity profile vector adaptation. result of user interaction with the system. 

FIG. 17 is a block diagram illustrating the entity vector Starting at a state 2250, the PVUS module 224 updates the 

adaptation process of the EVUS module 228 shown in FIG. profile vector based upon the current user query 217. Mov- 

2. The EVUS module 228 performs the entity vector adap- ing to a decision state 2252, the profile vector of the user and 

tation in response to a message indicating that a user has 55 the behavior vector for the search query is compared to the 

responded to (clicked on) an entity. The EVUS module 228 entity vectors and ranked according to this comparison to 

includes various sub-components including an entity vector establish a score. The profile score of the user is compared 

adaptation module 2100, a transform query to content vector against the behavior score for the search query to determine 

(TQCV) module 2102, an entity vector lookup module 2104, which score is larger. If the query score is larger, the PVUS 

a profile vector lookup module 2106 and an entity vector go module 224 proceeds to update the bias vectors (slate 2254). 

database update module 2108. The bias vectors are used to make sure that during the 

An adaptation event occurs when a user enters a query continual mutual adaptation process that the vectors do not 

217 or clicks on an ad or performs an on-line purchase of a all end up with the same entity vector. Next, at a state 2258 

product 222. When the entity adaptation event occurs, the an advertisement is selected based upon the query score, 

user action that resulted in the event, such as a user query 65 Proceeding to a decision state 2260, the system checks 

217, the EVUS module 228 obtains from the RTAS module whether a user clicked on the provided advertisement. If the 

214 the entity ID for the selected ad and the user ID. user did not select the advertisement the system returns to 
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the state 2250 to repeat the adaptation process upon future updating the ad vector with a user query after the user 

user queries. If the user selected an advertisement, the selected an advertisement. Similarly, the equations in block 

EVUS module 228 updates the selected ad vector with the 2280 are performed at the state 2264 during the update of the 

behavior vector for the user query 217 (state 2262). Next, at profile vector with the selected advertisement 

a state 2264, the PVUS module 224 updates the profile 5 The equations in block 2282 (shown in FIG. 19a) are 

vector with the entity vector for the advertisement and performed at the state 2254 (shown in FIG. 18) to update the 

moves to a state 2274. bias vectors to insure that all of the vectors do not eod up 

Referring again to the decision state 2252, if the profile with the same multidimensional vector. The equations in a 

score is greater than the query score, the bias vectors are block 2286 are performed at the state 2272 during the update 

updated in state 2265. Next, at a state 2266, the ad selected 10 an aQl vector with the profile vector. These equations are 

by the profile scoring is sent to the client 219. Proceeding to performed when the profile score of the user is greater than 

a decision state 2268, the system checks whether a user me °, ue ry score- Similarly, the equations in a block 2288 

clicked on the provided advertisement. If the user did not (shown in FIG. 19b) are performed in the state 2270 (shown 

select the advertisement, the system returns to the state 2250 ^ F* 0 - 18 ) during the update of the profile vector with the 

to repeat the adaptation process upon the receipt of future 15 a d vector. 

user queries 217. If the user selected an advertisement, the T° c equations in a block 2290 (shown in FIG. 19b) are 

PVUS module 224 updates the profile vector with the entity performed during the update of the bias vectors after the 

vector for the advertisement (state 2270). The EVUS module profile score of the user is greater than the score for the users 

228 then updates the selected ad vector with the profile <^ery at the state 2265 (shown in FIG. 18). 

vector for the user query 217. 20 

w . * Calculation of Entity Inventory Weighting (Cost 

Moving from a state 2264 or a state 2272, at a state 2274 Model) 

the cost model associated with the selected advertisement is 

updated. The system then returns to the state 2250 to repeat The RTAS module 214 keeps track of the number of 

the adaptation process upon the receipt of future user queries impressions generated for each entity or ad and periodically 

217. 25 sends a message to the EVUS module 228 containing each 

The profiles of individual users and the corresponding ad ID and me number of impressions generated for each ad. 

current query are represented by the following two sets The RTAS module 214 uses the ad parameters speafying the 

shown in equations 11 and 12. targeted number of impressions and the time each ad will 

remain eligible for serving to calculate inventory manage- 

30 ment weights. The weights are applied by the RTAS module 

P.irf-lpi^ • - - » P/"" 01 (11) 214 when scoring the active ads. The RTAS module 214 

causes under-impressed ads to receive more impressions 

Q«jd~{qi , • ■ • > q* ( 32 ) while suppressing impressions for over-impressed ads. 

User queries includes topical information which may be 35 Relationship Discovery and Analysis Processing 
represented by more than one content vector. Equation 13 Module 
shows that each ad is represented in a similar way to the 

profile vectors. The present system has the ability to perform relationship 

discovery and analysis on the contents of the profile vector 
database 226 and the entity vector database 230 (shown in 
Aw-Hai***" 0 - • ■ a/ - ** 0 } (13) 40 FIG. 2). The results of the analysis process are listed in a 

series of reports that are presented to an analyst 240. These 
Equations 14 and 15 are applied during the decision state reports provide high value actionable information about the 
2252 shown in FIG. 15 when computing the scores. behavior pattern of the system users. 

The RDA module 236 takes the contents of the profile 

P _score</^, A\ ^,) = [p.- -of^Vaiuw] < 14 > vector database 226 and the entity vector database 230 to 

produce the analysis reports 238 (shown in FIG. 2). A variety 

9 _scorc(a^, A i a«)= (15) of conventional statistical techniques or neural network 

.max fl^vatue^ x i (q- t - af** > threshold^)] approaches are utilized to perform this analysis. 

50 The RDA module 266 performs three analytical functions 
including unsupervised categorization, supervised categori- 
The Value parameter is computed by traditional cost zation and conventional statistics. Each of mese functions is 
model algorithms (not shown). described below. 

FIGS. 19a and 19b are the equation flow diagrams for the 1. Unsupervised categorization, 
fixed rate learning algorithm performed by the profile vec- 55 The RDA module 236 performs unsupervised categorize - 
tors update module and the entity vector update module don or clustering of the contents of the profile vector 
shown in FIG. 2. Unless otherwise noted, the adaptation database 225, the entity vector database 230 or both of these 
equations shown in FIGS. 19a and 19b are common to both modules. The RDA module 236 may use any clustering 
profile and entity content vector updates. The parameters algorithm as long as the algorithm can process real number 
used to control the adaptation process are shown in FIG. 20. 60 vectors. The result of the clustering is a set of cluster 
The following is a description of when the equations centroid vectors that represent regions of high density. These 
illustrated in FIGS. 19a and 19b are performed in relation to centroids are the same type of vectors that are used for all the 
the states illustrated in FIG. 18. The equations in a block other processing operations. Thus, the RDA system is able 
2276, 2284 (shown in FIG. 19a) are performed during the to find a set of word vectors that are closest in the content 
state 2250 (shown in FIG. 18) during the process of updating 65 vector space to the centroid vector. The words associated 
the profile vector with a query. The equations in a block with these vectors, and their closeness metrics, constitute the 
2278 are performed at the state 2262 during the process of key concepts of the centroid. The evolution of cluster 
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centroids over time provides trend information. The identi- 
fication of word vectors close to these cluster centroids 
provides a key word summary for each cluster, easing the 
analysis and interpretation task. Conventional statistical 
analysis may also be used to investigate cluster contents. 5 
The variance of cluster membership is a reliable indication 
of cluster coherence and "unity of focus" in terms of 
behavior. The cluster identified in the product vector data- 
base 226 provides highly valuable information about the 
characteristics of the users who purchase the products. This 10 
result is a natural consequence of the mutual adaptation 
between the ad vectors and the user content vectors. The 
number of clusters found and the explanation of the clusters 
are listed in the analysis reports 238 for inspection by the 
analyst 240. 15 

2. Supervised categorization. 

The RDA module 236 also performs supervised catego- 
rization of the contents of the profile vector database 226, 
the entity vector database 230, or both of these modules. For 
supervised categorization, the RDA module 236 may 20 
employ any algorithm that can process real number vectors. 
The result of the categorization is a set of users and/or 
entities that are members of each category. The count and 
membership list, by category, are listed in the analysis 
reports 238 for inspection by the analyst 240. 25 

3. Conventional statistics. 

The RDA module 236 applies conventional statistics to 
the contents of the profile vector database 226, the entity 
vector database 230, or both of these modules. This analysis 30 
produces counts, count distributions, means and variances of 
the database contents as a whole or any subset which is 
selected by the analyst 240. The results of the processing are 
listed in the analysis reports 238 for inspection by the analyst 
240. 3S 

A summary of the operation of the system for the adver- 
tising scenario is as follows. 
System Initialization 

First, the word content vector learning module 200 creates 
multiple content vectors from text input. This technique, 40 
results in a set of word content vectors, one word content 
vector for each word found in the training vocabulary and 
not in the stop list 204. In the advertising scenario, the 
training texts are usually product descriptions and trade 
magazines. For instance, if the targeted domain of operation 45 
is sporting goods, the training corpus would contain the 
product descriptions from sporting goods catalogs, the con- 
tents of several years of "Field and Stream" magazine and 
books on outdoor life. The selection of the correct training 
corpus, oriented toward the domain of operation, such as 50 
outdoor sports, gives the correct "spin" on the usage of 
words. In the aforementioned example, "bow" becomes 
associated with "arrow" as opposed to "violin" or an action 
performed on stage at the completion of a performance. 

Next, the session compiler module 210 computes an 55 
initial set of entity vectors from descriptions of the products 
for sale. These descriptions might come from the product 
catalog or be typed in by a human operator. For instance, 
"Warm Goretex, down rilled parka perfect for those cold 
winter nights and days on the ski slopes" might serve as the 60 
text initialization for the entity vector for a coat The entity 
vector is a normalized weighted sum of the word vectors of 
the words in the description. The session compiler module 
210 creates an initial entity vector for each ad in the system. 

The session description file 212 describes the guidelines 65 
for ad placement, such as display times authorized, sites, 
which ads to show and how many impressions are available, 



is compiled into a machine usable form. This results in the 
creation of a session control file 215 with information that 
controls the operation of the ad serving process. Finally, a 
plurality of user profile vectors are initialized to NULL. 
Regular System Operation 

During normal operation, an Internet user visits the web 
site and the user's identity is established via login, or 
through the web browser cookie. Each user identity is 
associated with a profile vector. If the user is new, a null 
profile vector is created and added to the profile vector 
database. If the user is known to the system, the user's 
profile vector is retrieved from the profile vector database. 

The system then observes the user's behavior. If a user 
performs a search query 217, the user profile vector is 
updated using the content vector of the current query. If the 
user requests a web page to read 218, the user profile vector 
is modified by the content vector of the requested web page. 

The updated profile vector is then used as the basis for 
selecting relevant ads to display to the user. Relevance is 
determined by closeness of the user profile vector to entity 
vectors stored in the ad vector database. The closest entity 
vector is selected as being most relevant. 

If the user is presented an ad, and the user requests more 
information of the subject of the ad by clicking on the ad, 
both the current ad entity vector and the profile vector are 
updated. The user profile vector is moved or adapted a small 
step in the direction of the entity vector. The entity vector is 
moved or adapted a small step in the direction of the profile 
vector of the person who clicked on the ad. Thus, the system 
allows ads to assume the characteristics of the users that 
purchase them in a real-time adaptive manner. 
Discovery and Analysis Mode 

During discovery and analysis mode, at periodic intervals, 
the system administrator performs relationship discovery 
and analysis of the contents of both the entity vector 
database 230 and the user profile vector database 226 to 
discover useful and exploitable characteristics of user 
behavior. The RDA module 236 performs vector clustering, 
reporting and provides summary statistical information of 
system effectiveness in terms of user clickthrougb rate. The 
behavior clusters discovered during analysis serves as the 
basis of advertising campaigns and provides valuable insight 
into user behavioral preferences. The behavior clusters can 
be utilized as the initial conditions for entity vectors the 
advertiser wants to target by group, rather than on an 
individual basis. 

The system shown in FIG. 2 has two sets of adaptive 
components. First are the user profile vectors. Second are the 
content vectors for each of the advertisements. Both sets of 
vectors influence each other based upon the actions of the 
users. Ads are updated in such a manner as to adapt toward 
regions of high user interest, thus improving system effec- 
tiveness and automatically discovering group preferences. 
User profile vectors adapt based on the observed actions of 
the user, thus providing an accurate and timely representa- 
tion of user preferences both individually and as a group. 

Since the sets of vectors are dependent on a user's actions 
and evolve with time, they encode a wealth of information 
about the behavior of the user population. These content 
vector sets can be "mined" with the analysis tools to provide 
valuable actionable insight concerning both individual, 
group and whole population behavior. 

While the above detailed description has shown, 
described, and pointed out the fundamental novel features of 
the invention as applied to various embodiments, it will be 
understood that various omissions and substitutions and 
changes in the form and details of the system illustrated by 
those skilled in the art, without departing from the intent of 
the invention. 
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What is claimed is: so as to identify at least one entity vector closely 

1. A computerized system for associating an observed associated with the observed behavior, and 
behavior with items, comprising: a selector accessing the electronic database with the 

a converter capable of converting the observed behavior identified entity vector so as to select at least one 

to a behavior vector 5 electronic advertisement to communicate to the user 

- * computing device, 

a profile adapter capable of modifying a profile vector 10 Xhe systcm ^ dcfincd m claim 9 tbc sc]ecmt 

with the behavior vector; and includes inventory management to allow selection of an 
a comparater capable of comparing the modified profile entity that is under-selected according to the selection sched- 
vector to a plurality of entity vectors, the entity vectors 1Q ule and to inhibit the selection of an entity that is over- 
indicative of the items, so as to identify at least one selected according to the selection schedule, 
entity vector closely associated with the observed 11. The system as defined in claim 9, wherein the selector 
behavior. includes inventory management to allow selection of the 

2. The computerized system as defined in claim 1, entity vector based upon a presentation delivery schedule, 
wherein the observed behavior is selected from the group J5 12. The system as defined in claim 9, wherein the 
consisting of: submitting a query to a web site, requesting a observed behavior is selected from the group consisting of: 
web page, purchasing a product, visiting a merchant, visiting submitting a query to a web site, requesting a web page, 
a web site, inquiring about a product, watching a TV show, purchasing a product, visiting a merchant, visiting a web 
and watching a movie. site, inquiring about a product, watching a TV show, and 

3. The computerized system as defined in claim 1, 2Q watching a movie. 

wherein at least one of the items is selected from the group 13. The system as defined in claim 9, wherein the corn- 
consisting of: a coupon, an advertisement, a solicitation, puter environment is connected to a network, the network 
information relating to a product, information relating to a selected from the group consisting of: an intranet, a local 
set of services, a page, section or chapter of a book, a area network, and the Internet. 

document, a newspaper article, a movie, a TV show, a web 25 14. The system as defined in claim 9, wherein the con- 
site, and a textual material. verter includes a page content vector lookup module which 

4. The computerized system as defined in claim 1, identifies the behavior vector based upon a page identifier, 
wherein the computerized system is connected to a network, 15. The system as defined in claim 9, wherein the 
the network selected from the group consisting of: an observed behavior is selected from the group consisting of: 
intranet, a local area network, and the Internet. 3Q a user query, a page view, or a purchase of a product. 

5. The computerized system as defined in claim 1, 16. The system as defined in claim 9, wherein the profile 
wherein the converter includes a page content vector lookup adapter modifies the profile vector based upon at least one 
module which identifies the behavior vector based upon a parameter selected from the group consisting of: a threshold 
page identifier. by which a behavior vector will be used instead of the entity 

6. The computerized system as defined in claim 1, 3S vector, a learning rate for a profile update, a leaning rate for 
wherein the observed behavior comprises a user query, and an entity update, an update rate for a query universe estimate 
wherein the converter includes an entity content vector of a mean, an update for a profile universe update of a mean, 
module for trarisforrning the user query into behavior vec- a forgetting factor for a set of profile vectors, a forgetting 
tors based upon the component words of the user query. factor for a set of entity vectors, a mean of a set of entity 

7. The computerized system as defined in claim 1, ^ vectors, a mean of a set profile vectors, and a mean of a set 
wherein the profile adapter modifies the profile vector based of behavior vectors. 

upon at least one parameter selected from the group con- 17. The system as defined in claim 9, wherein the com- 

sisting of: a threshold by which a behavior vector will be parater includes a vector closeness determination module to 

used instead of the entity vector, a learning rate for a profile calculate the distance between the modified profile vector 

update, a leaning rate for an entity update, an update rate for 4S and any one of the entity vectors. 

a query universe estimate of a mean, an update for a profile 18. A computerized system for adapting an entity vector, 

universe update of a mean, a forgetting factor for a set of comprising: 

profile vectors, a forgetting factor for a set of entity vectors, a converter capable of converting an observed behavior of 

a mean of a set of entity vectors, a mean of a set profile a use,- mto a behavior vector 

vectors, and a mean of a set of behavior vectors. a profile ad ter We of modifying a fiIe vector 

8. me computerized system as defined in claim 1, indicative of the user based on the behavior vector and 
wherein the comparater includes a vector closeness deter- . . , , r .... 

. t . « i . i * . «. , . an entity adapter capable of modifying an entity vector 

mination module to calculate the distance between the - ? ^ . , i_ £ , 

,. c , 4 , ^ indicative of an item based on the profile vector or the 

modified profile vector and any one of the entity vectors. behavior vector 

9. A system for selecting advertisements in a computer ee 10 10 , 

' ■ ■ , K S5 19. The system as defined in claim 18, wherein at least 

environmen , comprising: one of the items is selected from the group consisting of: a 

a database of electronic advertisements; and coupon, an advertisement, a solicitation, information relat- 

an electronic advertisement management system, com- ing to a product, information relating to a set of services, a 

prising: page, a section or a chapter of a book, a document, a 

a converter capable of converting an observed behavior 60 newspaper article, a movie, a TV show, a web site, and a 

of a user computing device in the computer eovi- textual material. 

ronment to a behavior vector, 20. The system as defined in claim 18, wherein the 

a profile adapter capable of modifying a profile vector observed behavior is selected from the group consisting of: 

indicative of the user with the behavior vector, submitting a query to a web site, requesting a web page, 

a comparater capable of comparing the modified profile 65 purchasing, a product, visiting a merchant, visiting a web 

vector to a plurality of entity vectors, the entity site, inquiring about a product, watching a TV show, and 

vectors indicative of the electronic advertisements, watching a movie. 
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21. The system as defined in claim 18, wherein the a converter capable of converting an observed behavior of 
computerized system is connected to a network, the network a user into a behavior vector; and 

selected from the group consisting of: an intranet, a local an entity adapter capable of modifying an entity vector 

area network, and the Internet. indicative of an item based on the behavior vector. 

22. The system as defined in claim 18, wherein the 5 32. The system as defined in claim 31, wherein observed 
converter includes a page content vector Lookup module behavior is selected from the group consisting of: submitting 
which identifies the behavior vector based upon a page a query to a web site, requesting a web page, purchasing a 
identifier. product, visiting a merchant, visiting a web site, inquiring 

23. The system as defined in claim 18, wherein the aboutaprochict,watchmgaTVsliow,andwatchmgamovie. 
observed behavior is selected from the group consisting of: 10 33 • ^ system as defined in claim 31, wherein the 
a user query, a page view, or a purchase of a product. computerized system is connected to a network, the network 

24. The system as defined in claim 18, wherein the profile selected from the group consisting of: an intranet, a local 
adapter modifies the profile vector based upon at least one area network, and the Internet . l 
parameter selected from the group consisting of: a threshold M - ^ 35 ^nned in claim 31, wherein the 
bywhichabehaviorvectorwillbeusedinsteadoftheentity 15 c ?! v f rt !J ^eludes a page content vector lookup module 
vector, a learning rate for a profile update, a learning rale for " ^ 3 
an entity update, an update rate for a query universe estimate P %* ^ s ^ tem ^ defined ^ claim 31> ^ nin the 
of a mean, an update for a profile universe update of a mean, observed beh avior is selected from the group consisting of: 
a forgetting factor for a set of profile vectors, a forgetting a ^ qucry> a pagp ^ or a purchasc of a pro duct. 
factor for a set of entity vectors, a mean of a set of entity 20 36. The system as defined in claim 31, wherein the item 
vectors, a mean of a set profile vectors, and a mean of a set ^ selected from the group consisting of: a coupon, an 
of behavior vectors. advertisement, a solicitation, information relating to a 

25. The system as defined in claim 18, wherein the entity product, information relating to a set of services, a page, a 
adapter modifies the entity vector based upon at least one section or a chapter of a book, a document, a newspaper 
parameter selected from the group consisting of: a threshold 25 article, a movie, a TV show, a web site, and a textual 
by which a behavior vector will be used instead of the profile material. 

vector, a leaning rate for a profile update, a learning rate for 37. A system for selecting advertisements in a computer 

an entity update, an update rate for a query universe estimate environment, comprising: 

of a mean, an update for a profile universe update of a mean, a database of electronic advertisements; and 

a forgetting factor for a set of profile vectors, a forgetting 30 an electronic advertisement management system, com- 

factor for a set of entity vectors, a mean of a set of entity prising: 

vectors, a mean of a set profile vectors, and a mean of a set a converter capable of converting an observed behavior 

of behavior vectors. of a user computing device in the computer envi- 

26. A system for generating a profile vector in a computer ronment to a behavior vector, 

environment, comprising: 35 a comparater capable of comparing the behavior vector 

a converter capable of converting a plurality of observed to a plurality of entity vectors, the entity vectors 

behaviors of a user into an associated plurality of indicative of the electronic advertisements, so as to 

behavior vectors; and identify at least one entity vector closely associated 

a profile adapter capable of repeatedly modifying a profile with the observed behavior, and 

vector indicative of the user based on the plurality of 40 a selector accessing the electronic database with the 

behavior vectors. identified entity vector so as to select at least one 

27. The system as defined in claim 26, wherein at least electronic advertisement to communicate to the user 
one of the plurality of observed user behaviors is selected computing device. 

from the group consisting of: submitting a query to a web 38. The system as defined in claim 37, wherein the 

site, requesting a web page, purchasing a product, visiting a 45 computer environment is connected to a network, the net- 

merchant, visiting a web site, inquiring about a product, work selected from the group consisting of: an intranet, a 

watching a TV show, and watching a movie. local area network, and the Internet. 

28. The system as defined in claim 26, wherein the 39. The system as defined in claim 37, wherein the 
computer environment is connected to a network, the net- converter includes a page content vector lookup module 
work selected from the group consisting of: an intranet, a 50 which identifies the behavior vector based upon a page 
local area network, and the Internet. identifier. 

29. The system as defined in claim 26, wherein the 40. The system as defined in claim 37, wherein the 
observed behavior is selected from the group, consisting of: observed behavior is selected from the group consisting of: 
a user query, a page view, or a purchase of a product. a user query, a page view, or a purchase of a product. 

30. The system as defined in claim 26, wherein the profile 55 41. The system as defined in claim 37, wherein the 
adapter includes modifies the profile vector based upon at comparater includes a vector closeness determination mod- 
least one parameter selected from the group consisting of: a ule to calculate the distance between the behavior vector and 
threshold by which a behavior vector will be used instead of any one of the entity vectors. 

the profile vector, a learning rate for a profile update, a 42. The system as defined in claim 37, wherein the 
learning rate for an entity update, an update rate for a query 60 selector includes inventory management to allow selection 
universe estimate of a mean, an update for a profile universe of the entity vector of an entity being under-selected accord- 
update of a mean, a forgetting factor for a set of profile ing to a selection schedule and inhibit the selection of an 
vectors, a forgetting factor for a set of entity vectors, a mean entity that is over-selected according to the selection sched- 
of a set of entity vectors, a mean of a set profile vectors, and ule. 

a mean of a set of behavior vectors. 65 43. The system as defined in claim 37, wherein the 

31. A computerized system for adapting an entity vector, selector includes inventory management to allow selection 
comprising: of the entity based upon a presentation delivery schedule. 
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44. A method of associating an observed behavior with 
items on a computer including a data storage, comprising: 

converting an observed behavior to a behavior vector, 
modifying a profile vector with the behavior vector, and 
comparing the modified profile vector to a plurality of 
entity vectors, the entity vectors indicative of the items, 
so as to identify at least one entity vector closely 
associated with the observed behavior. 

45. The method as defined in claim 44, wherein the 
observed user behavior is selected from the group consisting 
of: submitting a query to a web site, requesting a web page, 
purchasing a product, visiting a merchant, visiting a web 
site, inquiring about a product, watching a TV show, and 
watching a movie. 

46. The method as defined in claim 44, wherein at least 
one of the items is selected from the group consisting of: a 
coupon, an advertisement, a solicitation, information relat- 
ing to a product, information relating to a set of services, a 
page, a section or a chapter of a book, a document, a 
newspaper article, a movie, a TV show, a web site, and a 
textual material 

47. The method as defined in claim 44, wherein the 
converting step identifies the behavior vector based upon a 
page identifier. 

48. The method as defined in claim 44, wherein the 
observed user behavior comprises a user query, and the 
converting step transforms the user query into vectors based 
upon the component words of the user query. 

49. The method as defined in claim 44, wherein the 
comparing step calculates the distance between the modified 
profile vector and any one of the entity vectors, 

50. A method of selecting advertisements in a computer 
environment, comprising: 

providing a database of electronic advertisements; 

converting an observed behavior of a user computing 
device in the computer environment to a behavior 
vector, modifying a profile vector indicative of the user 
with the behavior vector; 

comparing the modified profile vector to a plurality of 
entity vectors, the entity vectors indicative of the elec- 
tronic advertisements, so as to identify at least one 
entity vector closely associated with the observed 
behavior; 

accessing the electronic database with the identified entity 
vector; and 

selecting at least one electronic advertisement to commu- 
nicate to the user computing device. 

51. The method as defined in claim 50, wherein the 
selecting includes selecting an electronic advertisement that 
is under-selected according to a selection schedule and to 
inhibit the selection of an electronic advertisement that is 
over-selected according to the selection schedule. 

52. The method as defined in claim 50, wherein the 
selecting includes selecting the entity vector based upon a 
presentation delivery schedule. 

53. The method as defined in claim 50, wherein the 
modifying includes modifying the profile vector based upon 
at least one parameter selected from the group consisting of: 
a threshold by which a behavior vector will be used instead 
of the entity vector, a learning rate for a profile update, a 
learning rate for an entity update, an update rate for a query 
universe estimate of a mean, an update for a profile universe 
update of a mean, a forgetting factor for a set of profile 
vectors, a forgetting factor for a set of entity vectors, a mean 
of a set of entity vectors, a mean of a set profile vectors, and 
a mean of a set of query vectors. 
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54. The method as defined in claim 50, wherein the 
electronic advertisement is communicated to the user com- 
puting device via a network, the network selected from the 
group consisting of an intranet, a local area network, and the 

5 Internet. 

55. The method as defined in claim 50, wherein the 
observed user behavior is selected from the group consisting 
of: submitting a query to a web site, requesting a web page, 
purchasing a product, visiting a merchant, visiting a web 
site, inquiring about a product, watching a TV show, and 
watching a movie. 

56. The method as defined in claim 50, wherein the 
converting includes identifying the behavior vector based 
upon a page identifier. 

57. The method as defined in claim 50, wherein the 
15 observed user behavior comprises a user query, and the 

converting includes transforming the user query into behav- 
ior vectors based upon the component words of the user 
query. 

58. A method for adapting an entity vector on a computer 
20 including a data storage, comprising: 

converting an observed behavior of a user into a behavior 
vector, 

modifying a profile vector indicative of the user based on 
the behavior vector; and 
25 modifying an entity vector indicative of an item based on 
the profile vector or the behavior vector. 

59. The method as defined in claim 58, wherein at least 
one of the items is selected from the group consisting "of: a 
coupon, an advertisement, information relating to a 

30 solicitation, information relating to a product, a set of 
services, a page, a section or a chapter of a book, a 
document, a newspaper article, a movie, a TV show, a web 
site, and a textual material. 

60. The method as defined in claim 58, wherein the 
35 modifying the profile vector includes modi lying the profile 

vector based upon at least one parameter selected from the 
group consisting of: a threshold by which a behavior vector 
will be used instead of the entity vector, a learning rate for 
a profile update, a learning rate for an entity update, an 

40 update rate for a query universe estimate of a mean, an 
update for a profile universe update of a mean, a forgetting 
factor for a set of profile vectors, a forgetting factor for a set 
of entity vectors, a mean of a set of entity vectors, a mean 
of a set profile vectors, and a mean of a set of behavior 

45 vectors. 

61. The method as defined in claim 58, wherein the 
modifying the entity vector includes modifying the profile 
vector based upon at least one parameter selected from the 
group consisting of: a threshold by which a behavior vector 

50 will be used instead of the profile vector, a learning rate for 
a profile update, a learning rate for an entity update, an 
update rate for a query universe estimate of a mean, an 
update for a profile universe update of a mean, a forgetting 
factor for a set of profile vectors, a forgetting factor for a set 

55 of entity vectors, a mean of a set of entity vectors, a mean 
of a set profile vectors, and a mean of a set of behavior 
vectors. 

62. The method as defined in claim 58, wherein the 
computer is connected to a network, the network selected 

60 from the group consisting of: an intranet, a local area 
network, and the Internet. 

63. The method as defined in claim 58, wherein the 
observed user behavior is selected from the group consisting 
of: submitting a query to a web site, requesting a web page, 

65 purchasing a product, visiting a merchant, visiting a web 
site, inquiring about a product, watching a TV show, and 
watching a movie. 
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64. The method as defined in claim 58, wherein the converting includes transforming the user query into vectors 
converting includes identifying identifies the behavior vec- based upon the component words of the user query. 

tor based upon a page identifier. 77. The method as defined in claim 71, wherein the 

65. The method as defined in claim 58, wherein the modifying step including modifying for the entity vector 
observed user behavior comprises a user query, and the 5 modifies the behavior vector based upon at least one param- 
converting includes transforming the user query into vectors eter selected from the group consisting of: a threshold by 

^ ^ n **™ m V° acnX ™** °l mc which a behavior vector will be used instead of the profile 

66. A method of generating a profile vector in a computer vector fl {tm ^ ^ for fl file u ^ leami ra(e for 

envxronment on a computer including a data storage, com- &Q end d £ for £ ^ J^nate 

prising: 10 r r . 

, , . . . of a mean, an update for a profile universe update of a mean, 

converting a plural^ a forgctting factor for a xi ofptomc vcctors> a forgctting 

an associated plurality of behavior vectors; and r rt , «p * » c ♦ c 

r J ^ tactor for a set of entity vectors, a mean of a set of entity 

repeatedly modifying a profile vector indicative of the vectors, a mean of a set profile vectors, and a mean of a set 

user based on the plurality of behavior vectors. of behavior vectors. 

67. The method as defined in claim 66, wherein the 78 A mcthod of advertisements in a computer 
computer environment is connected to a network, the net- mdudi a data comprising: 

work selected irom the group consisting of: an intranet, a 

local area network, and the Internet. providing a database of electronic advertisements; 

68. The method as defined in claim 66, wherein the converting an observed behavior of a user computing 
observed user behavior is selected from the group consisting 20 device in the computer to a behavior vector; 

of: submitting a query to a web site, requesting a web page, comparing the behavior vector to a plurality of entity 

purchasing a product, visiting a merchant, visiting a web vectors, the entity vectors indicative of the electronic 

site, inquiring about a product, watching a TV show, and advertisements, so as to identify at least one entity 

watching a movie. vector closely associated with the observed behavior, 

69. The method as defined in claim 66, wherein the 25 acccS sing the electronic database with the identified entity 
converting includes identifying the behavior vector based vector, and 

upon a page identifier. , . . ... 

70. The method as defined in claim 66, wherein the « ™ electronic advertsement to comma- 
, , , , . . ' , , nicate to the user computing device. 

observed user behavior compnses a user query and the „ ^ method ^ claim 78 , hc 

converting includes transforming the user query into vectors i t - • , , , . ^ , * . . 

. , „ * t , , * r 4 . selecting includes selecting an electronic advertisement that 

based upon the component words of the user query. • t * j j- . . , , . . 

ti a ~* t u~A ~r a ^ * 15 under-selected according to a selection schedule and 

71. A method of adapting an entity vector on a computer - t . , 4 - r , 4 - , A A , . 
includin a data stora ■ inhibiting the selection of an electronic advertisement that is 

g rag , comprising. over-selected according to the selection schedule, 
converting an observed behavior of a user into a behavior 35 go. The method as defined in claim 78, wherein the 

vector, and selecting includes selecting the entity vector based upon a 

modifying an entity vector indicative of an item based on presentation delivery schedule. 

the behavior vector. 81. The method as defined in claim 78, wherein the 

72. The method as defined in claim 71, wherein at least comparing includes calculating the distance between the 
one of the items are selected from the group consisting of: 40 behavior vectors to any one of the entity vectors. 

a coupon, an advertisement, a solicitation, information relat- 82. The method as defined in claim 78, wherein the at 

ing to a product, information relating to a set of services, a least one electronic advertisement is communicated to the 

page, a section or chapter of a book, a document, a news- user computing device via a network, the network selected 

paper article, a movie, a TV show, a web site, and a textual from the group consisting of: an intranet, a local area 
material. 4S network, and the Internet. 

73. The method as defined in claim 71, wherein the 83. The method as defined in claim 78, wherein the 
computer is connected to a network, the network selected observed user behavior is selected from the group consisting 
from the group consisting of: an intranet, a local area of: submitting a query to a web site, requesting a web page, 
network, and the Internet. purchasing a product, visiting a merchant, visiting a web 

74. The method as defined in claim 71, wherein the 50 site, inquiring about a product, watching a TV show, and 
observed user behavior is selected from the group consisting watching a movie. 

of: submitting a query to a web site, requesting a web page, 84. The method as defined in claim 78, wherein the 

purchasing a product, visiting a merchant, visiting a web converting includes identifying the behavior vector based 

site, inquiring about a product, watching a TV show, and upon a page identifier. 

watching a movie. 55 85. The method as defined in claim 78, wherein the 

75. The method as defined in claim 71, wherein the observed user behavior comprises a user query, and the 
converting includes identifying the behavior vector based converting includes transforming the user query into vectors 
upon a page identifier. based upon the component words of the user query. 

76. The method as defined in claim 71, wherein the 

observed user behavior comprises a user query, and the * * * * * 
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